CDS
Accession Number | TCMCG075C14706 |
gbkey | CDS |
Protein Id | XP_007035526.2 |
Location | join(30424433..30424540,30425213..30425311,30425534..30425596,30425763..30425840,30426268..30426457,30426682..30426941,30427024..30427402,30427864..30427941,30428928..30429148,30429288..30429350,30430119..30430442,30430529..30430707,30430785..30431067,30431339..30431575,30431769..30431930,30432336..30432488,30432583..30432801,30433159..30433458,30433575..30433739) |
Gene | LOC18603464 |
GeneID | 18603464 |
Organism | Theobroma cacao |
Protein
Length | 1186aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_007035464.2 |
Definition | PREDICTED: protein ALWAYS EARLY 3 isoform X1 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
COG_category | K |
Description | Protein ALWAYS EARLY |
KEGG_TC | - |
KEGG_Module | - |
KEGG_Reaction | - |
KEGG_rclass | - |
BRITE |
ko00000
[VIEW IN KEGG] ko00001 [VIEW IN KEGG] |
KEGG_ko |
ko:K21773
[VIEW IN KEGG] |
EC | - |
KEGG_Pathway |
ko04218
[VIEW IN KEGG] map04218 [VIEW IN KEGG] |
GOs | - |
Sequence
CDS: ATGGCGCCATCTAGAAAATCTAAAAGTGTAAATAAGAAGTTTTCTTATGTTAATGAGGTTGCTTCTAGTAAAGATGGAGATAGTAGTGCTAAGAGAAGCGGGCAACGGAAAAGGAAGTTGTCTGACATGTTAGGGCCTCAATGGACTAAGGAAGAGCTTGAGCGTTTCTATGAAGCGTATCGCAAGTATGGGAAAGATTGGAAGAAGGTTGCTACTGTGGTACGAAATCGATCTGTGGAAATGGTAGAAGCTCTGTACACTATGAATAGGGCCTACTTATCTCTCCCGGAAGGCACTGCTTCTGTGGTTGGACTCATAGCGATGATGACTGATCACTATTGTGTTATGGGAGGAAGTGATAGTGAACAAGAAAGCAATGAGGGCGTGGGAGCTTCTCGGAAACCTCAGAAGCGTAGTAGGGGAAAACTTCGAGATCAACCCTCTAAAAGTTTAGATAAGTCATTTCCTGATCTTTTGCAATTTCATTCAGCTGCATCAAGTTATGGTTGCTTGTCATTGTTGAAGAGGAGACGCTCTGAAAGTAGGCCCCGTGCTGTTGGAAAAAGGACTCCTCGTGTTCCTATTTCTTTTTCTCATGACAAAAACAAAGGAGAAAGGTACTTTTCACCTATTAGGCAGGGCATGAAACTAAAGGTGGATACCGTTGATGATGATGTTGCTCATGAGATAGCATTAGTTTTGACGGAGGCATCACAAAGAGGTGGATCTCCTCAAGTTTCTCGAACACCAAACAGAAAAGCAGAGGCATCTTCACCTATTCTCAACAGTGAAAGGATGAATGCTGAGTCAGAAACTACTAGTGCCAAGATTCATGGTAGTGAAATGGATGAGGATGCTTGTGAATTGAGCTTAGGAAGCACTGAAGCTGATAATGCTGATTATGCTAGAGGTAAAAATTATTCAATGAATATAGAAGGGACTGGTACCATTGAAGTTCAACAGAAGGGAAAAAGATACTACAGAAGGAAGCCAGGGGTTGAGGAAAGTGTAAACAATCATCTGGAAGACACAAAAGAAGCCTGTAGTGGGACGGAAGAAGATCAAAAGTTATGTGATTTCAAGGGAAAGTTTGAAGCAGAGGTTGCAGATACCAAACCTTCTAGAGGCTCCATCAAGGGTCTAAGGAAAAGAAGTAAAAAAGTGTTGTTTGGGAGAGTTGAAGACACTTCCTTTGATGCCCTGCAAACTCTAGCAGATCTGTCCTTGATGATGCCAGAAACTGCTGCTGATACTGAGTCATCTGTGCAGTTCAAGGAAGAGAAAAATGAAGTTGTTGAGAAGACTAAACTGAAAGGAAACCATCCTGTTTCTGGAGCTAAAGGCACTGCCCCCAAAACATGTAAACAGGGAAAAGTTTTTGGTCATGATGTTCGTGCTATTCCCGAGGCAAAGGAGGAAACACACCCAGGTAATGTTGGAATGCGGAAAAGGAGACAGAAGTCCTCACCATATAAATTGCAGATTCCAAAAGATGAAACTGATGCTGATTCTCATTTGGGTGAATCTCGAAACATTGAGGCTTTAGATGAGGTAAAGAATTTTCCAAGCAAAGGTAAACGCTCTAATAATGTTGCACATTCAAAGCAAGGGAAATCAGTGAGACCTCCAGAGCATCGTTCCTCAAGTACTGATCATGGAAGGGACTTGAACAATTCAGCTCCATCTACCATACAGGTTTCACCTGTTAACCAGGTCAACCTACCCACAAAAGTCAGGAGTAAGAGAAAGATAGATGCACAGAAACAAGTGATTGGGAAGGATATAAAGTCCTCTGATGGTATTGTGAAGGGAAAATTTAGTGTTCCAGTTAGTTTATTCCATGACAGAGCACTCAATCTGAAGGAAAAGCTTTGTAACTTCCTATGTCCATATCAAGCACGGAGATGGTGTACCTTTGAGTGGTTCTGTAGTACAATTGATTATCCATGGTTTGCTAAAAGGGAGTTTGTGGAGTATTTGGATCATGTAGGATTGGGTCATGTTCCAAGATTAACTCGTGTTGAATGGGGTGTCATAAGGAGTTCCCTTGGCAAGCCACGAAGGTTTTCTGAGCAATTTTTGAAGGAAGAAAGAGAGAAGCTTTATCAATATCGGGAATCTGTTAGAACGCATTATGCTGAACTCCGTGCTGGTATTGGTGAAGGACTTCCAACTGATTTAGCTCGACCTCTATCAGTTGGACAGCGTGTTATTGCTATTCATCCAAAAACTAGAGAGATTCATGATGGAAATGTGTTAATTGTTGACCATAGTAGGTACCGGATTCAATTTGACAGCACTGAGCTAGGAGTGGAATCTGTCATGGATATTGATTGTATGGCTTTAAATCCATTGGAAAATTTGCCTGCTTCCCTTGTGAGACAAAATGCTGCTGTCAGGAAATTTTTTGAGAACTACAATGAGCTCAAAATGAACGGGCAGCCAAAAGAAAGCAAGATGGAAGAGAACATCAAATTTGCTTCGTGTGAGGAGAATGCCAATAGTCCCTCTCGAACTTCCCCATCAACTTTCAGTGTTGGCAATTTATCACAACTTGTTAAGGTTGATCCATCAAGTCCTAATTTACAACTTAAAGTTGGGCCTATGGAAACTGTTTATACTCAGCAGGCAGTAAATTCCCAGCCTTCTGCTCTGGCGCTGATACAGGCGAGGGAAGCTGATGTTGAAGCTCTTTCTCAGTTGACTCGTGCTCTTGACAAAAAGCATTTGCAGGAGGCTGTGGTCTCTGAACTACGGCGTATGAATGATGAGGTGTTGGAAAACCAGAAAGGTGGGGACAACTCTATAAAGGATTCAGATTCTTTCAAGAAGCAATATGCTGCTGTTCTTTTACAGTTAAATGAAGTCAATGAGCAGGTTTCTTCTGCTCTCTTTTCCTTGAGGCAACGCAATACATATCAAGGGACCTCCTCAGTTAGATTGCTGAAGCCCTTGGCTAAAATTGGTGAGCATGGTTGTCAGTTGAGCTCTTTTGATCATTCTATGCATCATGCCCAAGAATCTGTATCCCATGTGGCTGAAATTGTTGAAAGTTCAAGAACGAAAGCTCGGTCAATGGTGGATGCAGCTATGCAGGCTATGTCATCCTTGAGAAAAGGGGGGAAAAGCATCGAGAGGATTGAGGACGCAATAGATTTTGTAAATAACCAGCTTTCGGTGGATGATCTTAGTGTGCCTGCTCCGCGGTCTTCTATCCCAATAGACTCAGCCCACAGTACGGTAACTTTTCACGATCATCTCACTGCCTTTGTGTCAAATCCACTGGCAACTGGTCATGCACCTGATACAAAGTTGCAAAATTCGTCTGACCAAGACGATCTTAGAATCCCTTCAGACCTTATCGTGCATTGTGTAGCCACCTTGCTCATGATTCAGAAGTGTACAGAAAGGCAGTTTCCACCTGGAGATGTTGCCCAGGTACTAGATTCTGCTGTTACTAGTTTGAAGCCGTGTTGTTCACAAAATCTCTCAATTTATGCAGAGATACAGAAATGTATGGGAATTATTAGGAACCAGATATTGGCGCTGGTACCTACATAG |
Protein: MAPSRKSKSVNKKFSYVNEVASSKDGDSSAKRSGQRKRKLSDMLGPQWTKEELERFYEAYRKYGKDWKKVATVVRNRSVEMVEALYTMNRAYLSLPEGTASVVGLIAMMTDHYCVMGGSDSEQESNEGVGASRKPQKRSRGKLRDQPSKSLDKSFPDLLQFHSAASSYGCLSLLKRRRSESRPRAVGKRTPRVPISFSHDKNKGERYFSPIRQGMKLKVDTVDDDVAHEIALVLTEASQRGGSPQVSRTPNRKAEASSPILNSERMNAESETTSAKIHGSEMDEDACELSLGSTEADNADYARGKNYSMNIEGTGTIEVQQKGKRYYRRKPGVEESVNNHLEDTKEACSGTEEDQKLCDFKGKFEAEVADTKPSRGSIKGLRKRSKKVLFGRVEDTSFDALQTLADLSLMMPETAADTESSVQFKEEKNEVVEKTKLKGNHPVSGAKGTAPKTCKQGKVFGHDVRAIPEAKEETHPGNVGMRKRRQKSSPYKLQIPKDETDADSHLGESRNIEALDEVKNFPSKGKRSNNVAHSKQGKSVRPPEHRSSSTDHGRDLNNSAPSTIQVSPVNQVNLPTKVRSKRKIDAQKQVIGKDIKSSDGIVKGKFSVPVSLFHDRALNLKEKLCNFLCPYQARRWCTFEWFCSTIDYPWFAKREFVEYLDHVGLGHVPRLTRVEWGVIRSSLGKPRRFSEQFLKEEREKLYQYRESVRTHYAELRAGIGEGLPTDLARPLSVGQRVIAIHPKTREIHDGNVLIVDHSRYRIQFDSTELGVESVMDIDCMALNPLENLPASLVRQNAAVRKFFENYNELKMNGQPKESKMEENIKFASCEENANSPSRTSPSTFSVGNLSQLVKVDPSSPNLQLKVGPMETVYTQQAVNSQPSALALIQAREADVEALSQLTRALDKKHLQEAVVSELRRMNDEVLENQKGGDNSIKDSDSFKKQYAAVLLQLNEVNEQVSSALFSLRQRNTYQGTSSVRLLKPLAKIGEHGCQLSSFDHSMHHAQESVSHVAEIVESSRTKARSMVDAAMQAMSSLRKGGKSIERIEDAIDFVNNQLSVDDLSVPAPRSSIPIDSAHSTVTFHDHLTAFVSNPLATGHAPDTKLQNSSDQDDLRIPSDLIVHCVATLLMIQKCTERQFPPGDVAQVLDSAVTSLKPCCSQNLSIYAEIQKCMGIIRNQILALVPT |